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Program Description 1 

Reading Edge is a middle school literacy program that emphasizes 
cooperative learning, goal setting, feedback, classroom manage- 
ment techniques, and the use of metacognitive strategy, whereby 
students assess their own skills and learn to apply new ones. The 
program is a component of the Success for All 2 (SFA)® whole-school 
reform model and provides eight levels of instruction, from begin- 
ning through eighth-grade reading levels. Students are grouped into 
classes based on ability, and whole-class reading instruction is deliv- 
ered in daily 60-minute blocks. Instruction at the early levels uses 
fiction, nonfiction, and simple scripts to help students develop basic 
decoding skills, reading fluency, vocabulary, and comprehension. At 
reading level 3 and higher, students focus on developing comprehen- 
sion strategies using both narrative and expository texts. All levels 
focus on building background knowledge and developing study skills. 
Although the program is often implemented in the context of the 
SFA® whole-school reform, this report focuses on Reading Edge as a 
stand-alone program in grades 4 and higher. 3 
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One study of Reading Edge that falls within the scope of the Adolescent Literacy review protocol meets What 
Works Clearinghouse (WWC) evidence standards without reservations, and no studies meet WWC evidence stan- 
dards with reservations. The one study included 405 sixth-grade students who attended one middle school in Flor- 
ida and one middle school in West Virginia. 5 Based on this one study, the WWC considers the extent of evidence 
for Reading Edge on adolescent readers to be small for one domain: comprehension. Three other domains are not 
reported in this intervention report. (See the Effectiveness Summary for further description of all domains.) 


Effectiveness 

Reading Edge was found to have no discernible effects on comprehension for adolescent readers. 


Table 1. Summary of findings 6 




Improvement index (percentile points) 




Outcome domain 

Rating of effectiveness 

Average 

Range 

Number of 
studies 

Number of 
students 

Extent of 
evidence 

Comprehension 

No discernible effects 

+5 

na 

1 

405 

Small 


na = not applicable 
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Program Information 

Background 

Reading Edge was developed by Robert Slavin and Nancy Madden in conjunction with Johns Hopkins University 
and is the middle school reading component of the SFA® whole-school reform model. Reading Edge is distrib- 
uted by the Success for All Foundation, Inc., 200 W. Towsontown Boulevard, Baltimore, MD, 21204-5200. Email: 
sfainfo@successforall.org. Web: http://www.successforall.org/Middle-High/Powerful-lnstruction/The-Reading- 
Edge-Middle-School/. Telephone: (800) 548-4998 ext. 2372. 

Program details 

Reading Edge is designed to combine explicit instruction in metacognitive reading strategies with peer collaboration, 
regular assessment and feedback, and classroom management techniques that are intended to engage students in 
learning. Although the instruction varies according to the nature of the text and the challenges it presents, every Read- 
ing Edge lesson follows a regular pattern: Setting the Stage, Active Instruction, Teamwork, and Time for Reflection. 

The program is segmented into eight levels. Instruction is delivered in daily 60-minute lessons in which students are 
grouped into separate classes according to their reading levels, regardless of age or grade level. Regrouping allows 
teachers to teach the whole class without having to break the class into multiple smaller reading groups. Students 
are reassessed quarterly to determine their current level of performance. 

Levels 1 through 3 of the program are remedial. Level 1 focuses on fundamental reading skills with an emphasis 
on individual words and improving comprehension. Using a sequence of illustrated stories, level 1 lessons present 
phonetically regular text that becomes more difficult as students master new skills. Levels 2 and 3 focus on fluency, 
comprehension skills and strategies, vocabulary, word recognition, and writing. These levels use simple fiction, 
nonfiction, and stories translated to scripts to teach basic decoding skills and improving reading fluency. Levels 4 
through 8 add story structure, literary techniques, purposes for writing, and the writing process. These levels use 
short stories, novels, poetry, and nonfiction. 

Professional development for Reading Edge teachers is provided in a two-day initial training that focuses on 
instructional strategies, routines, assessments, and cooperative-learning strategies, and provides support for 
administration and district personnel. Ongoing coaching and support are provided in quarterly coaching visits, 
follow-up telephone meetings, and informal telephone support for all staff members. 


Cost 

The Reading Edge program is built around classroom sets of nonfiction and fiction trade books. Schools purchase 
these separately from commercial vendors. Teacher manuals provide 40 weeks worth of detailed lesson plans, 
tests, homework, and vocabulary lists for each reading level. For details on specific product pricing, contact the 
program developer, the Success for All Foundation. 
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Research Summary 

Thirty-seven studies reviewed by the WWC investigated the effects Table 2. Scope of reviewed research 
of Reading Edge on adolescent readers. One study (Chamberlain, 

Daniels, Madden, & Slavin, 2007) is a randomized controlled trial 
that meets WWC evidence standards without reservations. That 
one study is summarized in this report. The remaining 36 studies 
do not meet either WWC eligibility screens or evidence standards. 

(See references beginning on p. 5 for citations for all 37 studies.) 

Summary of study meeting WWC evidence standards without 
reservations 

Chamberlain et al. (2007) conducted a randomized controlled trial 
that examined the effects of Reading Edge on sixth-grade students attending two rural middle schools, one in 
Florida and one in West Virginia. The study used a two-stage random assignment design. All sixth-grade teachers 
were randomly assigned to teach either the Reading Edge program or the normal reading curriculum. All incoming 
sixth-grade students at the two schools were stratified by state reading assessment levels, gender, and ethnicity, 
and then randomly assigned to intervention or comparison teachers. The WWC based its effectiveness ratings on 
findings from comparisons of 203 students in the Reading Edge group and 202 students in the comparison group 
The study reported student outcomes after approximately seven to nine months of program implementation. 

Summary of studies meeting WWC evidence standards with reservations 

No studies of Reading Edge meet WWC evidence standards with reservations. 


Grade 

6 

Delivery method 

Small group/Whole 
class 

Program type 

Curriculum 

Studies reviewed 

37 

Meets WWC standards 
without reservations 

1 study 

Meets WWC standards 
with reservations 

0 studies 
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Effectiveness Summary 

The WWC review of interventions for Adolescent Literacy addresses student outcomes in four domains: alphabet- 
ics, reading fluency, comprehension, and general literacy achievement. The one study that contributes to the effec- 
tiveness rating in this report covers one domain: comprehension. The findings below present the authors’ estimates 
and WWC-calculated estimates of the size and statistical significance of the effects of Reading Edge on adolescent 
readers. 7 For a more detailed description of the rating of effectiveness and extent of evidence criteria, see the WWC 
Rating Criteria on p. 15. 

Summary of effectiveness for the comprehension domain 

Chamberlain et al. (2007) did not find a statistically significant positive effect of Reading Edge on the total Gates- 
MacGintie Reading Test score for students in grade 6. The effect was not large enough to be considered substan- 
tively important according to WWC criteria (i.e., an effect size of at least 0.25). 

Thus, for the comprehension domain, one study showed indeterminate effects. This results in a rating of no dis- 
cernible effects, with a small extent of evidence. 


Table 3. Rating of effectiveness and extent of evidence for the comprehension domain 


Rating of effectiveness 

Criteria met 

No discernible effects 

No affirmative evidence of effects. 

The review of Reading Edge in the comprehension domain had one study showing indeterminate effects. 

Extent of evidence 

Criteria met 

Small 

The review of Reading Edge in the comprehension domain was based on one study that included two schools and 
405 students. 
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Appendix A: Research details for Chamberlain et al. (2007). 

Chamberlain, A., Daniels, C., Madden, N. A., & Slavin, R. E. (2007). A randomized evaluation of the Suc- 
cess for All middle school reading program. Middle Grades Research Journal, 2(1), 1-21. 

Table A. Summary of findings Meets WWC evidence standards without reservations 


Study findings 
Average improvement index 

Outcome domain Sample size (percentile points) Statistically significant 


Outcome domain 

Study findings 
Average improvement index 

Sample size (percentile points) Statistically significant 

Comprehension 

405 students +5 No 

Setting 

The study was conducted in two rural middle schools, one in Florida and the other in West 
Virginia. Both schools served high-poverty, majority-White populations. In the Florida school, 69% 
of the students qualified for free lunch, and 75% of the students were White. In the West Virginia 
school, 50% of the students qualified for free lunch, and 95% of the students were White. 

Study sample 

Recruitment of schools occurred in 2005, and the study was conducted during the 2005-06 
school year. The study used a two-stage random assignment design. All sixth-grade teachers in 
the study schools were randomly assigned to teach either the Reading Edge program (interven- 
tion) or the normal reading curriculum (comparison). All incoming sixth-grade students at the 
two schools were stratified by state reading assessment levels, gender, and ethnicity, and then 
randomly assigned to intervention or comparison teachers. The analysis sample used in the 
study included 203 students in the Reading Edge group and 202 students in the comparison 
group. The study used a delayed treatment design, whereby comparison teachers received the 
Reading Edge materials and training in the fall of 2008 (after the study’s data collection period). 

Intervention 

group 

Students in the intervention group received daily reading instruction from teachers who were 
given materials and training to implement the Reading Edge program. Students were assessed 
and then grouped in classes by their reading level. They were reassessed quarterly to allow 
for movement to other group levels and for targeted instruction. The intervention as imple- 
mented in this study differs slightly from the implementation of Reading Edge in the context 
of the SFA® whole-school reform. Although most SFA® Reading Edge schools use a full- 
time, school-based facilitator, these research sites did not have facilitators. In addition, most 
schools involve a large percentage of their teaching staff in the program; these research sites 
had a limited number of participating teachers. Finally, most schools have one to three com- 
mon reading periods for regrouping per day; these research sites did not have common read- 
ing periods. The study reported student outcomes after approximately seven to nine months 
of program implementation. 

Comparison 

group 

Students in the comparison group received daily reading instruction using “business-as-usual” 
methods and materials. In both schools, comparison students were sorted into comparison 
classes based on fifth-grade reading scores. The comparison group teachers in the Florida 
school used Corrective Reading for students with low reading skills and Scholastic Read XL 
for other students. The comparison group teachers in the West Virginia school used Scholastic 
Read 180 and the Scholastic Read Aloud Anthology for students with low reading skills, Larry 
Bell’s “Twelve Powerful Words” in all reading classes, and a teacher-designed “Literacy Circle” 
for small groups of more proficient students. 
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Outcomes and 
measurement 


Support for 
implementation 


For the pretest, students took the Scholastic Reading Inventory in the fall of 2005. For the 
posttest, the students took the Gates-MacGinitie Reading Test (GMRT) at the end of the 
school year, in the spring of 2006. The study reports total GMRT scores, which are the average 
of scores on two subscales: Comprehension and Vocabulary. For a more detailed descrip- 
tion of the posttest outcome measures, see Appendix B. Findings for the total GMRT can be 
found in Appendix C. Subtest findings can be found in Appendix D. The study also measured 
classroom instructional elements using a classroom observation tool based on the Classroom 
Assessment Scoring System (CLASS), but these measures did not meet inclusion criteria as 
outcomes for the Adolescent Literacy Topic Area. For a more detailed description of these 
outcome measures, see Appendix B. 

Training was provided to teachers for the implementation of Reading Edge. No details are 
given about the extent and timing of the training. 
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Appendix B: Outcome measures for each domain 


Comprehension 

Gates-MacGinitie Reading Test (GMRT), 
total score 

The GMRT is used to assess a student's decoding, vocabulary, and passage comprehension skills. The total 
score is an average of two components that independently assess reading vocabulary and comprehension skills 
(as cited in Chamberlain et at, 2007). 

Reading comprehension construct 

GMRT, Comprehension subtest 

The Comprehension subtest measures each student’s ability to read and understand different types of prose. The 
test contains 11 passages of various lengths and subjects and 48 questions (as cited in Chamberlain et al., 2007). 

Vocabulary development construct 

GMRT, Vocabulary subtest 

The Vocabulary subtest measures each student’s reading vocabulary by asking the student to choose one word 
or phrase that means most nearly the same as a presented word. The test contains 45 questions (as cited in 
Chamberlain et at, 2007). 
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Appendix C: Findings included in the rating for the comprehension domain 





Mean 

(standard deviation) 

WWC calculations 


Study 

Sample 

Intervention 

Comparison 

Mean Effect Improvement 

Outcome measure 

sample 

size 

group 

group 

difference size index p-value 


Chamberlain et al. (2007) a 

GMFtT total score Grade 6 405 students 57.93 

(16.15) 

55.76 

(16.10) 

2.17 

0.13 

+5 

0.06 

Domain average for comprehension (Chamberlain et al., 2007) 



0.13 

+5 

Not 

statistically 

significantly 

Domain average for comprehension across all studies 



0.13 

+5 

Not 

statistically 

significantly 


Table Notes: For mean difference, effect size, and improvement index values reported in the table, a positive number favors the intervention group and a negative number favors 
the comparison group. The effect size is a standardized measure of the effect of an intervention on student outcomes, representing the change (measured in standard deviations) 
in an average student’s outcome that can be expected if the student is given the intervention. The improvement index is an alternate presentation of the effect size, reflecting 
the change in an average student’s percentile rank that can be expected if the student is given the intervention. The statistical significance of each study’s domain average was 
determined by the WWC. For Chamberlain et al. (2007), the effect was not statistically significant (p > 0.05). GMRT= Gates-MacGinitie Reading Test. 

a For Chamberlain et al. (2007), no corrections for clustering or multiple comparisons and no difference-in-differences adjustment were needed. The p-value presented here was 
reported in the original study. The means shown in this table are the adjusted posttest means reported in the study. 
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Appendix D: Supplemental subtest findings for the comprehension domain 


Mean 

(standard deviation) WWC calculations 


Outcome measure 

Study 

sample 

Sample 

size 

Intervention 

group 

Comparison 

group 

Mean 

difference 

Effect 

size 

Improvement 

index 

p-value 

Chamberlain et al. (2007) a 

GMRT-Comprehension 

Grade 6 

405 students 

29.87 

(9.14) 

28.90 

(8.92) 

0.97 

0.11 

+4 

0.16 

GMRT-Vocabulary 

Grade 6 

405 students 

28.05 

26.85 

1.20 

0.15 

+6 

0.05 


(7.92) (8.35) 

Table Notes: The supplemental findings presented in this table are additional subtest findings from the study in this report that do not factor into the determination of the interven- 
tion rating. For mean difference, effect size, and improvement index values reported in the table, a positive number favors the intervention group and a negative number favors 
the comparison group. The effect size is a standardized measure of the effect of an intervention on student outcomes, representing the change (measured in standard deviations) 
in an average student's outcome that can be expected if the student is given the intervention. The improvement index is an alternate presentation of the effect size, reflecting the 
change in an average student’s percentile rank that can be expected if the student is given the intervention. GMRT = Gates-MacGinitie Reading Test. 

a For Chamberlain et al. (2007), a correction for multiple comparisons was needed and resulted in significance levels that differ from those in the original study. The p-values pre- 
sented here were reported in the original study. Due to the multiple comparisons adjustment, the p-value of 0.05 for the Vocabulary subtest of the GMRT was higher than the critical 
p-value of 0.025 for statistical significance; therefore, the WWC does not find the result to be statistically significant. The means shown in this table are the adjusted posttest means 
reported in the study. 
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Endnotes 

1 The descriptive information for this program was obtained from a publicly available source: the program’s website (http://www. 
successforall.org/Middle-High/Powerful-lnstruction/The-Reading-Edge-Middle-School/, downloaded June 2011). The WWC requests 
developers review the program description sections for accuracy from their perspective. The program description was provided to the 
developer in July 201 1 ; however, the WWC received no response. Further verification of the accuracy of the descriptive information for 
this program is beyond the scope of this review. The literature search reflects documents publicly available by August 201 1 . 

2 Success for All (SFA)® is a whole-school reform model that includes reading, writing, and oral language development programs for 
students in prekindergarten through eighth grade. 

3 Studies of Reading Edge conducted in grade 3 and lower are outside of the scope of the Adolescent Literacy review protocol. 

4 The studies in this report were reviewed using WWC Evidence Standards, version 2.1 , as described in the Adolescent Literacy review 
protocol, version 2.0. 

5 The evidence presented in this report is based on available research. Findings and conclusions may change as new research 
becomes available. 

6 For criteria used in the determination of the rating of effectiveness and extent of evidence, see the WWC Rating Criteria on p.15. 
These improvement index numbers show the average and range of student-level improvement indices for all findings across the study. 

7 The level of statistical significance was reported by the study authors or, when necessary, calculated by the WWC to correct for clus- 
tering within classrooms or schools and for multiple comparisons. For the formulas the WWC used to calculate the statistical signifi- 
cance, see WWC Procedures and Standards Handbook, Appendix C for clustering and WWC Procedures and Standards Handbook, 
Appendix D for multiple comparisons. In the case of Chamberlain et al. (2007), a correction for multiple comparisons was needed for 
the subscale scores presented in Appendix C, so the significance levels may differ from those reported in the original study. 

8 The WWC computes an average effect size as a simple average of the effect sizes across all individual findings within the study 
domain. The average effect size across the two subtest measures corresponds to the GMRT total score effect size. 

Recommended Citation 

U.S. Department of Education, Institute of Education Sciences, What Works Clearinghouse. (2012, June). 
Adolescent Literacy intervention report: Reading Edge. Retrieved from http://whatworks.ed.gov. 
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WWC Rating Criteria 

Criteria used to determine the rating of a study 


Study rating 

Criteria 

Meets WWC evidence standards 
without reservations 

A study that provides strong evidence for an intervention’s effectiveness, such as a well-implemented RCT. 

Meets WWC evidence standards 
with reservations 

A study that provides weaker evidence for an intervention's effectiveness, such as a QED or an RCT with high 
attrition that has established equivalence of the analytic samples. 

Criteria used to determine the rating of effectiveness for an intervention 

Rating of effectiveness 

Criteria 

Positive effects 

Two or more studies show statistically significant positive effects, at least one of which met WWC evidence 
standards for a strong design, AND 

No studies show statistically significant or substantively important negative effects. 

Potentially positive effects 

At least one study shows a statistically significant or substantively important positive effect, AND 

No studies show a statistically significant or substantively important negative effect AND fewer or the same number 

of studies show indeterminate effects than show statistically significant or substantively important positive effects. 

Mixed effects 

At least one study shows a statistically significant or substantively important positive effect AND at least one study 
shows a statistically significant or substantively important negative effect, but no more such studies than the number 
showing a statistically significant or substantively important positive effect, OR 

At least one study shows a statistically significant or substantively important effect AND more studies show an 
indeterminate effect than show a statistically significant or substantively important effect. 

Potentially negative effects 

One study shows a statistically significant or substantively important negative effect and no studies show 
a statistically significant or substantively important positive effect, OR 

Two or more studies show statistically significant or substantively important negative effects, at least one study 
shows a statistically significant or substantively important positive effect, and more studies show statistically 
significant or substantively important negative effects than show statistically significant or substantively important 
positive effects. 

Negative effects 

Two or more studies show statistically significant negative effects, at least one of which met WWC evidence 
standards for a strong design, AND 

No studies show statistically significant or substantively important positive effects. 

No discernible effects 

None of the studies shows a statistically significant or substantively important effect, either positive or negative. 

Criteria used to determine the extent of evidence for an intervention 

Extent of evidence 

Criteria 

Medium to large 

The domain includes more than one study, AND 
The domain includes more than one school, AND 

The domain findings are based on a total sample size of at least 350 students, OR, assuming 25 students in a class, 
a total of at least 14 classrooms across studies. 

Small 

The domain includes only one study, OR 
The domain includes only one school, OR 

The domain findings are based on a total sample size of fewer than 350 students, AND, assuming 25 students 
in a class, a total of fewer than 14 classrooms across studies. 
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Glossary of Terms 

Attrition 

Clustering adjustment 
Confounding factor 

Design 
Domain 
Effect size 

Eligibility 

Equivalence 

Extent of evidence 

Improvement index 

Multiple comparison 
adjustment 

Quasi-experimental 
design (QED) 

Randomized controlled 
trial (RCT) 

Rating of effectiveness 

Single-case design 
Standard deviation 


Statistical significance 


Substantively important 


Attrition occurs when an outcome variable is not available for all participants initially assigned 
to the intervention and comparison groups. The WWC considers the total attrition rate and 
the difference in attrition rates across groups within a study. 

If intervention assignment is made at a cluster level and the analysis is conducted at the student 
level, the WWC will adjust the statistical significance to account for this mismatch, if necessary. 

A confounding factor is a component of a study that is completely aligned with one of the 
study conditions, making it impossible to separate how much of the observed effect was 
due to the intervention and how much was due to the factor. 

The design of a study is the method by which intervention and comparison groups were assigned. 
A domain is a group of closely related outcomes. 

The effect size is a measure of the magnitude of an effect. The WWC uses a standardized 
measure to facilitate comparisons across studies and outcomes. 

A study is eligible for review and inclusion in this report if it falls within the scope of the 
review protocol and uses either an experimental or matched comparison group design. 

A demonstration that the analysis sample groups are similar on observed characteristics 
defined in the review area protocol. 

An indication of how much evidence supports the findings. The criteria for the extent 
of evidence levels are given in the WWC Rating Criteria on p. 15. 

Along a percentile distribution of students, the improvement index represents the gain 
or loss of the average student due to the intervention. As the average student starts at 
the 50th percentile, the measure ranges from -50 to +50. 

When a study includes multiple outcomes or comparison groups, the WWC will adjust 
the statistical significance to account for the multiple comparisons, if necessary. 

A quasi-experimental design (QED) is a research design in which subjects are assigned 
to intervention and comparison groups through a process that is not random. 

A randomized controlled trial (RCT) is an experiment in which investigators randomly assign 
eligible participants into intervention and comparison groups. 

The WWC rates the effects of an intervention in each domain based on the quality of the 
research design and the magnitude, statistical significance, and consistency in findings. The 
criteria for the ratings of effectiveness are given in the WWC Rating Criteria on p. 15. 

A research approach in which an outcome variable is measured repeatedly within and 
across different conditions that are defined by the presence or absence of an intervention. 

The standard deviation of a measure shows how much variation exists across observations 
in the sample. A low standard deviation indicates that the observations in the sample tend 
to be very close to the mean; a high standard deviation indicates that the observations in 
the sample tend to be spread out over a large range of values. 

Statistical significance is the probability that the difference between groups is a result of 
chance rather than a real difference between the groups. The WWC labels a finding statistically 
significant if the likelihood that the difference is due to chance is less than 5% (p < 0.05). 

A substantively important finding is one that has an effect size of 0.25 or greater, regardless 
of statistical significance. 


Please see the WWC Procedures and Standards Handbook (version 2.1) for additional details. 
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